Optimal local implementation of non-local quantum gates 
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We investigate the minimal resources that are required in the local implementation of non-local quantum 
gates in a distributed quantum computer. Both classical communication requirements and entanglement con- 
sumption are investigated. We present general statements on the minimal resource requirements and present 
optimal procedures for a number of important gates, including CNOT and Toffoli gates. We show that one bit of 
classical communication in each direction is both necessary and sufficient for the non-local implementation of 
the quantum CNOT, while in general two bits in each direction is required for the implementation of a general 
two bit quantum gate. In particular, the state-swapper requires this maximum classical communication over- 
head. Extensions of these ideas to multi-party gates are presented. 
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I. INTRODUCTION 



A quantum computer [1-3] allows, in principle, for the effi- 
cient solution of some problems that are intractable on a clas- 
sical computer, the most striking example being the factor- 
ization of large numbers [4,5]. However, the practical prob- 
lems involved in the actual construction of a quantum com- 
puter of an interesting size (certainly more than 50 qubits) 
that is capable of performing a sufficiently large number of 
logical gates (a few hundred appear as a lower limit for an in- 
teresting problem involving 50 qubits) are daunting. Problems 
range from fundamental effects such as decoherence and dis- 
sipation, experimental imperfections for example in the tim- 
ing, length and intensity of the laser pulses to the non-trivial 
task of storing and isolating reliably a large number of qubits 
[3,6-8]. In fact, in proposals such as ion trap or the cavity 
QED implementations it seems problematic to store and pro- 
cess very large numbers of qubits in a single 'processor'. A 
possible way out would be the construction of a quantum com- 
puter not as a local device that contains all qubits in a single 
processor, but to build it from the outset as a multi-processor 
device where each processor contains only a small number of 
qubits. Such a 'distributed quantum computer' can be viewed 
as a generalization of a quantum communication network in 
which each node can act as a sender or receiver and contains 
only a small number of qubits. Distributed quantum com- 
putation has been considered previously by Grover [9], and 
he demonstrated that the solution of a phase estimation prob- 
lem can be obtained efficiently with such a device assuming 
ideal conditions. It was later shown, that even under non-ideal 
conditions, i.e., in the presence of decoherence, a distributed 
quantum computer can be superior to a classical computer in 
terms of the resources that are required for the solution of 
the phase estimation problem [10]. However, these investiga- 
tions considered the specific problem of phase estimation and 
did not address the question of universal quantum computa- 
tion. Before one is able to consider the physical resource effi- 
ciency of a distributed quantum computer in general, it is nec- 
essary to establish first optimal implementations of quantum 



gates between qubits that are located in different nodes of the 
distributed quantum computer. This problem is addressed in 
this paper. We present optimal protocols implementing gates 
that affect qubits in different nodes (here dubbed non-local 
gates) only using local operations and classical communica- 
tion (LOCC) and previously shared entanglement. Optimality 
is measured in terms of the consumption of the basic exper- 
imental resources of entanglement and classical communica- 
tion between nodes. We present general theorems that give 
lower bounds on the resources required for the implementa- 
tion of quantum gates and for several universal quantum gates 
we present optimal implementations. We also discuss the gen- 
eral structure of the classical communication transfer in these 
implementations . 

It should be noted that the issue addressed in the present 
paper is different from the question as to whether (and how) a 
particular entanglement transformation is possible under local 
quantum operations and classical communication [11] in that 
in the course of the non-local implementation of a quantum 
gate the initial state is not known in advance. Instead, with 
the use of shared entanglement particular joint unitary opera- 
tions between several parties are simulated. 

In Section II we begin with an investigation of two-qubit 
gates. We establish some lower bounds on the resources that 
are required to implement two-qubit gates and present optimal 
implementations for a number of important gates. In particu- 
lar we present a protocol that implements a CNOT gate con- 
suming one ebit of entanglement and using only one classical 
bit of communication between the two parties. We then pro- 
ceed in Section III to study multi-party gates such as Toffoli 
gates and other more general multi-party quantum gates again 
presenting bounds on the required physical resources and op- 
timal protocols for some important classes of gates. 



II. NON-LOCAL TWO-QUBIT GATES 

General single-bit rotations together with a CNOT gate are 
sufficient to implement any multi-qubit unitary transforma- 
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tion. This implies that the resource requirements for the im- 
plementation of a CNOT gate are a limiting factor in the con- 
struction of general unitary transformations in a distributed 
quantum computer. For this reason we investigate first the 
CNOT gate. 

Theorem 1 One bit of classical communication in each di- 
rection and one shared ebit is necessary and sufficient for the 
non-local implementation of a quantum CNOT gate. 

Proof: (i) Necessity: To demonstrate that one bit of commu- 
nication in each direction is necessary we first note that the 
procedure consists of local operations and classical communi- 
cation. As local operations cannot transmit information from 
Alice to Bob, or vice versa, all information which has been 
sent at the end of the operation must have been sent classi- 
cally. Consider now the CNOT quantum gate. If the target 
qubit is initialised in the state |0), then its final state will be 
|0) or |1) depending on the initial state of the control qubit 
being |0) or |1) respectively. Therefore, the final result of the 
gate in this case is the communication of one bit of informa- 
tion from Alice (holding the control qubit) to Bob (holding the 
target qubit). Consequently, in the non-local implementation, 
one bit of classical information must have been sent classi- 
cally from Alice to Bob. The reason for this can be seen from 
an elegant argument presented in the figure caption of the last 
figure in [12] (see [13] for more details). In short, assume that 
Alice needs to send less than one bit. In that case she could 
omit sending the bit and force Bob to make a guess. As he 
would guess the correct answer with a probability larger than 
1/2, Alice and Bob could then use error correction codes to 
establish a perfect channel and would end up with a superlu- 
minal communication channel. To see that one bit must also 
have been sent from Bob to Alice, we need merely note that in 
the basis |±) = (|0) ± |l))/\/2 the role of control and target 
in a CNOT gate are reversed. Consequently, if Alice's parti- 
cle is prepared in the standard state |+) and Bob chooses to 
prepare his particle either in state |+) or |— ), Alice will, af- 
ter the application of the CNOT gate, hold a particle which is 
either in state |+) or |— ) depending on the state Bob's par- 
ticle has been prepared in. Therefore one bit of information 
has been transmitted from Bob to Alice. As the implementa- 
tion of the CNOT must be independent of the initial state, the 
procedure must allow for one bit of communication in each 
direction, and as a consequence the non-local implementation 
must involve, as a minimum, one bit of communication in both 
directions. 

That one ebit is required can be seen from the fact that 
a CNOT gate acting on the initial state [\Q) A + \1}a)\0)b 
leads to a maximally entangled state (|00) J 4B + |11)ab- As 
the amount of entanglement cannot be increased by local op- 
erations, this implies that the non-local implementation of a 
CNOT gate must consume at least one ebit. 

(ii) Sufficiency: In the following we construct a quantum 
circuit which performs the CNOT non-locally using one e- 
bit and the transmission of one classical bit in each direction. 



This quantum circuit is given in figure 1 . The CNOT is per- 
formed between the qubits A and B. Alice holds the qubits A 
and A\, and Bob holds the qubits B and B\. The wavy line 
connecting A\ and B\ signifies that they are entangled. In par- 
ticular we will choose their initial state to be ( |00) + 1 1 1) ) /V2. 
The initial state of A is necessarily arbitrary, and so is given 
bya|0)A + (3\1}a- The initial state of B is also arbitrary, and 
is given by 7|0)b + 5\1)b- Time now flows from left to right 
in figure 1 . First a local CNOT is performed with A as the 
control and A\ as the target. After this the combined state of 
A, Ai and B x is 



4=(a|000) + o|011) + /3|110) + /3|101))aa iBi . 
V2 



(1) 



Alice then performs a measurement on A\ in the computa- 
tional basis, and the line corresponding to this qubit termi- 
nates. The result of the measurement is one bit of information, 
which is communicated to Bob, and this communication is de- 
noted by the dashed line. If the result is |0) Bob does nothing, 
and if the result is 1 1) Bob performs the not operation. At this 
point the combined state of A and B\ is a\00) ab 1 +0\H) AB!- 
That is, we have now effectively performed a CNOT between 
A and B\, in which the initial state of 2 was |0). Now parti- 
cle Bi contains the necessary information about the state of 
A. We can now perform a CNOT between B\ and B. The 
combined state of A, B\ and B is now 

-^=(a 7 |000) + a<5|001) + /W|110) + ^\\\\))ab^b- 

All we have to do is to remove B\ from the state. This is done 
by performing a Hadamard transformation on B\, and then 
measuring B\ in the computational basis, at which point the 
line denoting Bi terminates. The result of the measurement 
(one bit) is communicated to Alice. If the result is '0' Alice 
does nothing, and if the result is '1' she performs a (state- 
independent) u z operation on particle A. This completes the 
non-local CNOT. ■ 



(2) 
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FIG. 1. A quantum circuit to perform the CNOT non-locally with 
minimal classical communication. Alice has the qubits A and A\, 
and Bob has B and B\. Alice and Bob are only allowed to com- 
municate classically, and this communication is represented by the 
dashed lines. Each dashed line denotes one bit of communication. 

Theorem 2 A control-U gate can be implemented using one 
shared ebit and one bit of classical communication in each 
direction. 
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Proof: A control-U gate is defined as a gate that applies the 
identity on the target qubit if the control bit is in state |0) and it 
applies the unitary operator U to the target if the control qubit 
is in state The same quantum circuit as in Fig. 1 can be 
used except that the CNOT gate on Bobs side is replaced by a 
control-U gate. ■ 

In general a single application of a control-U gate cannot 
be employed to create one e-bit from an initial product state of 
two qubits. Furthermore, the amount of classical information 
that can be sent from Alice to Bob via a general control-U 
gate is less than one bit. This raises the question as to whether 
such a control-U gate can be implemented with less resources 
than a full ebit and one classical bit of communication in each 
direction. Clearly this will not be possible when we only wish 
to implement a single instance of a control-U gate. However, 
it may be conceivable that one has a situation in which one 
needs to carry out a large number of control-U gates simulta- 
neously. In that case it is conceivable that this could be done 
with less than 1 ebit of entanglement per gate and less than 
one bit of classical communication in each direction. How- 
ever, this turns out to be a difficult question and we have been 
unable to find such a scheme. 

Let us now move on to investigate general two-qubit quan- 
tum gates to establish the minimum resource requirements for 
their implementation. 

Theorem 3 Two bits of classical communication in both di- 
rections and two shared ebits is sufficient for the non-local 
implementation of a general two-bit gate. 

Proof: To demonstrate that this amount of communication 
is sufficient to implement all quantum operations we need 
merely invoke quantum teleportation. Any operation may be 
performed by teleporting Alice's state to Bob, at which point 
Bob may locally perform the operation, and then teleport the 
resulting state back to Alice. This procedure requires two bits 
of communication in each direction and 2 shared ebits [12,16] 
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FIG. 2. A state swapper implemented by means of three quantum 
CNOT gates. 

Moreover, there are two-qubit gates that require two bits 
of classical communication in each direction and consumes 2 
bits. An example is the state-swapper, which may be writ- 
ten as three CNOT gates, one after the other, with Alice as 
the control, target, and then control, in that order (see Fig. 
2) . To show that two bits of classical communication are 
required (each way) in the non-local implementation of this 



gate, we need to show that this amount of information may 
be communicated from Alice to Bob (and vice versa) when 
the gate is performed. To do this we merely have to note that 
at the completion of the gate Alice has sent her state to Bob. 
Now, this state could have been initially in a maximally en- 
tangled state with a qubit that Bob possesses. Dense coding 
tells us that this enables Alice to send two bits of information 
to Bob [17]. Naturally, Bob can use the same procedure to 
send two bits of information to Alice. Therefore, in a non- 
local implementation, the state swapper requires at least two 
bits of communication in each direction. An analogous ar- 
gument shows that the state swapper would also require two 
shared ebits, as a state swapper can be used to establish two 
ebits from a product state. To achieve this one simply ap- 
plies the state swapper to particles A 2 and B 2 of the state 
(|00>^ 2 + |ll) Al ^ 2 )(|00) SlS2 + |ll) BlBa ). 

It is remarkable that the swap gate requires only two shared 
ebits as it can be shown that three CNOT gates are necessary 
to implement it when one employs the ordinary gate array pic- 
ture using a universal set of quantum gates that is made up of 
CNOT gates and local unitary operations [18]. This observa- 
tion may be useful, as it demonstrates that in some cases the 
use of entanglement can be replaced partially by local mea- 
surements and classical communication. 

Before we move on to investigate the implementation of 
non-local multi-party gates we would like to analyze the struc- 
ture of the classical information transfer involved in the gate 
implementation somewhat further. In both examples dis- 
cussed above it turned out that the classical information trans- 
fer between the two parties is symmetric, i.e., the same num- 
ber of bits need to be sent from Alice to Bob and vice versa. 
Likewise, the amount of classical information that can be sent 
using these two-qubit gates is also the same in each direc- 
tion. It is therefore quite natural to ask whether this is the case 
in general. Indeed we have not been able to find a counter- 
example and we therefore make the following two closely re- 
lated conjectures. 

Conjecture 4 The minimal amount of classical communica- 
tion required to implement any two-party quantum gate with 
one qubit associated with each party and shared M ebits, 
M — 1,2, is always the same in each direction. 

Conjecture 5 The amount of classical information that can 
be sent via any two-qubit gate is the same in each direction. 

While these conjectures appear natural, we have not been 
able to find general proofs for them. However, we have 
been able to confirm both of them for a number of classes 
of two-qubit quantum gates. An example of a gate which has 
the same classical information capacity in both directions is 
the CNOT gate whose optimal implementation has been de- 
scribed above. How can we see that a quantum gate is sym- 
metric with respect to its capability for classical information 
transfer? Before we move on to the most general case, let us 
consider the CNOT gate. Imagine we have the ability to per- 
form a CNOT gate with Alice as the control and Bob as the 
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target. Using this gate and local operations only, we can then 
also implement a CNOT with Alice as a target and Bob as a 
control, simply by applying a Hadamard gate to each qubit 
both before and after the CNOT, see Fig. 3. 
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FIG. 3. A CNOT gate, with A as control and B as target, sur- 
rounded by Hadamard gates is equivalent to a CNOT gate with A as 
target and B as control. 

The two versions of the CNOT gate are also related via the 
(nonlocal) state swapper. 



TjBA _ tj TjAB rrf 
^CNOT — ^ss^CNOT^ss 
AB 



= (H ® H)U$? m (H ® H 



(3) 



where C/crrar re P resent s the CNOT gate with A as a control 
and B as a target and U ss denotes the state swapper. In gen- 
eral if we can achieve the transformation Uba = U ss UabUI s 
from Uab an d purely local operations, i.e., if there exist local 
one-qubit unitary operators U\, U2, U3 and U4 for which we 
have 



U BA = U SS U AB U\ S 

= (Ui ® U 2 )U AB (U 3 ® Ua) 



(4) 



then Eq. (4) is a sufficient condition for the classical informa- 
tion transmission capacities in each direction to be equal. In 
the following we will determine some sets of quantum gates 
Uab for which Eq. (4) holds. 

Let us begin with a slightly simpler problem. Suppose that 
we have a two-qubit quantum gate V\ £ U (4). V\ can be ex- 
pressed in terms of its generator as V\ — exp(ii?i), where the 
generator Hi is a Hermitean operator. We now define another 
quantum gate V2 as 



V2 = U ss V 1 Ut = U ss e iH 'Ul 



= e 



= e 



(5) 



where the generator H2 of V2 is clearly a Hermitean operator. 
Our goal can therefore be reformulated as: For which unitary 
operators Vi can we write V 2 as V 2 = (U\ ® U2)Vi (Jj\® U\), 
or equivalently for which generators Hi of V\ can we write 



H 2 = U ss HiUl = (Ui ® U 2 )Hi{U\ ® U\). 



(6) 



Note that this is less general than the transformation in Eq. (4). 
It is useful to realize that both the unitary operator Vi and its 
generator Hi are diagonal in the same basis, say {| fc) , i — 
1, 2, 3, 4}. Furthermore, we can decompose Hi with re- 
spect to its eigenvectors as Hi = J2i K I <Pi) (4>i\ = Z)» ^*P»> 
where Aj is the eigenvalue of Hi corresponding to the eigen- 
vector \4>i). Consequently, Eq. (6) becomes 



We can now prove a number of lemmas. We begin with 

Lemma 6 Any two-qubit quantum gate that has a generator 
with a single non-vanishing eigenvalue is symmetric with re- 
spect to its classical information transfer capacity. 

Proof: Suppose that the only non-vanishing eigenvalue of the 
generator Hi is Ai [15]. In that case we can always find one- 
qubit unitary operators Ui and U2 such that Eq. (7) holds. To 
see this, note that the eigenstate | fa) is actually a pure state 
describing a system composed by two qubits. Therefore, it has 
the Schmidt decomposition | (pi) = J2k VPk \ I = 
J2k y/Pk I k) I Furthermore, in this case we have 



Y u ss I fa) ( ul = \iY Q I k) (l\ (l\ 

i k,l 

= (U®U) \ Y X iVpWi\k)\k)(l\(l\) (LTtgc^t) 



(U®U)[Y*i\<l>i)(<t>i\](U*®Ut), 



(8) 



where U is defined to be the unitary operator which maps each 
basis vector | i) to its corresponding | i) . Similarly, the unitary 
operator U maps each basis vector | i) to its corresponding | i), 
i.e., U = W. M 

Another non-trivial class of quantum gates Ubd for which 
condition (6) holds, is the one whose generator is Bell diago- 
nal, i.e., we have 

Lemma 7 Any two-qubit quantum gate that has a generator 
which is Bell-diagonal is symmetric with respect to its classi- 
cal information transfer capacity. 

Proof: If I vp) is any of the Bell states, the reader can easily 
verify that 

|*)<*| =U SS \^){^\U\ S = {<r z ®a z )\^){^\(a z ®a z ) 

Therefore, for the quantum gate Ubd, condition (6) is satisfied 
by either choosing Ui = U2 = 1 or Ui = U2 = cr z . Re- 
call that g z is the Pauli matrix corresponding to the arbitrarily 
chosen z direction. ■ 

Note however, that condition (6) is not satisfied for all 
quantum gates Uab- A counterexample is the gate 



[/4B = e lAl |0+)(0+|+e iA2 |0-)(0 
+ e a3 |10)(10| + e lA4 |ll)(ll| 



(9) 



For Ai = A 2 = and non-trivial choice of A 3 and A 4 it is not 
possible to find local unitary operators Ui and U2 such that 
Eq. (6) is satisfied. Nevertheless, it is possible to find local 
unitary operators Ui, U2, U3 and U4 which satisfy the more 
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general condition (4). The local unitary operators will be of 
the form [14]: 

[7i = e~ iA4 | 1) < 1| + e i(A 3 -A 4 ) | o) ( 0| , (10) 

[/ 2 = |l)(l|+ e -^- A ^|0)(0|, (11) 

U 3 = 1, (12) 

[/ 4 = e lA4 |l>(l| + |0)(0|. (13) 

We can then conclude to the following lemma: 

Lemma 8 The amount of classical information that can be 
sent via any control-U gate of the form 

U = | 0) (0| ® 1 + 1 1) ( 1| ® (e iAa | 0) ( 0| + e 1 ^ | 1) ( 1|) 

is the same in each direction. 

It should be noted that this does not mean that the amount of 
information transferred in any particular operation of the gate 
will be the same in both directions, as this will depend upon 
the choice of initial states. However, an implementation of the 
gate must work for all possible initial states, (in particular it 
must work for the case where both qubits are pure and there- 
fore contain their maximum capacity), and this is what puts 
the limit on the minimal communication requirement. 

It is clear that we may now put 2-bit quantum gates into two 
classes. Those which require no more than one bit of two-way 
communication, and those that require more than one bit (but 
no more than two bits). The CNOT falls into the first cate- 
gory, and the state-swapper falls into the second. Two other 
standard gates which fall into the first category are the c-U 
(which performs a unitary transformation on one system de- 
pending on the state of the other), and the state-preparer. 

III. NON-LOCAL MULTI-PARTY GATES 

In the previous section we have presented a number of re- 
sults concerning the implementation of non-local two-qubit 
quantum gates in a distributed quantum computer. In the fol- 
lowing we will generalize these ideas to local implementation 
of multi-qubit gates, i.e., gates where more than two parties 
are involved. To illuminate the system behind the construc- 
tion, we explain the implementation of the Toffoli gate from 
which the generalization to other multi-party gates will be ev- 
ident. 

Theorem 9 Two shared ebits and a total of four bits of clas- 
sical communication are necessary and sufficient for the lo- 
cal implementation of a non-local three-party quantum Toffoli 
gate. 

Proof: (i) Necessity: A Toffoli gate can be reduced to an 
ordinary CNOT gate when one fixes the state of one of the 
control qubits to be Chose the state of party A to be 
Then the initial state is 



and after the application of the Toffoli gate we find 

IV'ini) = |1)a(o7|00) + aJ|01) + /3 7 |11> + PS\10)) BC , 

(15) 

which shows that we have implemented a CNOT between par- 
ties B and C. Therefore, Theorem 1 implies that one classical 
bit has to be exchanged in both directions between A and the 
target party C and one ebit has to be shared between them. 
The same argument applies when we fix the state of qubit B 
tobe|l). 

Sufficiency. The implementation of the Toffoli gate with 
these minimal resources is presented in Fig. 4. Assume that 
Alice and Clare share a pair A\, C\ of qubits in a maximally 
entangled state \<j>+) = (|00) + |11))/V2, and that Bob and 
Clare share another pair of particles B\ and C2 in the same 
state. Then the initial state of the whole system consisting of 
particles A, B, C, Ai, B\, C\ and C2 is of the form 

|V) = W) A ® \ib) B ® |V)c ® \^ + )a iCi ® \4> + )b iC2 , (16) 



where 



|V>A = a|0>+/?|l}, 
H)b =7|0) + <5|1>, 
|V>c = r?|0)+£|l>. 



(17) 
(18) 
(19) 



The first step is a local quantum CNOT gate on A and A\ with 
A as control. Then Alice measures particle A\ and Clare per- 
forms a NOT operation on her particle C\ if Alice finds |1) 
and the identity if Alice finds |0). Qubit A\ is subsequently 
discarded. Now Bob applies a local CNOT with B being the 
control and B\ being the target. Then Bob measures particle 
B\ and Clare performs a NOT operation on her particle C2 
if Bob finds |1) and the identity if Bob finds |0). Qubit B x is 
subsequently discarded. Now the state of the remaining qubits 
A, B, C, C\ and C2 is given by 

H00) + /3|ll)) ACl ® ( 7 |00> + S\11)) BC2 ® |V)c. (20) 




|Vini) = |l)A(a|0)+/3|l))( 7 |0)+<5|l)), 



(14) 



FIG. 4. A quantum circuit for the non-local implementation of a 
Toffoli gate. 
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In a further step Clare applies locally a Toffoli with C\ 
and Ci being the control qubits. Subsequently Clare applies 
Hadamard gates to the qubits C\ and C 2 . Then she measures 
C2 and applies a z or the identity 1 to B if her result is 1 1) or 
|0) respectively. Finally she measures C\ and applies a z or 
the identity to A if her result is |1) or |0) respectively. This 
completes the Toffoli gate. ■ 

The total number of classical bits which have to be commu- 
nicated is four, and only two shared ebits of entanglement are 
consumed. Again, these results can be generalized to three- 
party control-U operations that can be represented in matrix 
form with respect to the computational basis as 



le6 



where 



"00 "01 
UlO tin 



«00 "01 
UlO Mil 



(21) 



(22) 



is the matrix representation of a unitary operator U. We only 
need to replace the local Toffoli gate by a local three-party 
control-U. This gives rise to 

Lemma 10 A three party control-U gate can be implemented 
using four bits of classical communication and two shared 
ebits. 

Using Theorem 9 and Lemma 10 we are now in a position to 
construct every possible quantum gate array using only ebits, 
classical communication and local operations. In particular 
one could use the results in [19] to construct A-party con- 
trolled gates from CNOTs and single bit rotations. This, how- 
ever, is not optimal in terms of physical resources. While it 
will be difficult to construct the optimal procedure for gen- 
eral quantum gates, for some gates we are able to find these 
procedures. We find for example 

Theorem 11 An N party control-U gate can be implemented 
using 2 (TV — 1) bits of classical communication and N — 1 
shared ebits. 



basis. If the outcome is |1), then a z is applied to the qubit 
Pk\ if the outcome is |0) then no action is taken on qubit Pk- 
Qubit T fe is subsequently discarded. Hence, the total required 
resources are 2(A — 1) bits of classical information and N—l 
initially shared ebits. ■ 

The amount of consumed resources in the latter protocol is 
rather surprizing. In an inefficient non-local implementation 
of the above A-party gate one could employ the simulation of 
the gate with the use of two-party control-U gates and CNOT 
gates as in Ref. [19], but such that each step is realized non- 
locally. In such a procedure a supply of 3 x 2 N ~ 1 — 4 ebits 
would be necessary. 




¥1° 



FIG. 5. A quantum circuit for the non-local implementation of an 
iV-party control-U gate. 

A more efficient teleportation-based protocol [20] in which 
the respective states of the qubits at different nodes are twice 
teleported would still use 2(N - 1) ebits and 4(A - 1) bits of 
classical information. 



Proof: The control parties are enumerated from Pi to Pjv-i 
and each of them is carrying one ancilla numerated by P[ to 
P' N _ V The target qubit is denoted by T and the target party 
possesses A — 1 further ancillary qubits. The first A — 1 steps 
of the protocol are essentially analogous. In the k-th step a 
local quantum CNOT gate is applied on Pk and P' k with Pk 
as control. Then this party measures particle P' k and the target 
party performs a NOT operation on her ancillary qubit Tk if 
Alice finds |1) and the identity if Alice finds |0). Qubit P k is 
subsequently discarded. Now we apply an A-party controlled 
U gate on Clares particles, with the ancillas C\, . . . , Cjv-i 
being the control qubits and T the target. Subsequently the 
target party performs Hadamard gates on each of its ancillas. 

This is then followed by A — 1 steps involving measure- 
ments. In the k-th step qubit T k is measured in the |0), |1) 



IV. CONCLUSIONS 

In this work we have addressed the problem of the local 
implementation of non-local gates in a distributed quantum 
computer, i.e. a computer which is composed of many sub- 
units (local processors). Such a configuration may be useful, 
as it requires only a small number of qubits (e.g. ions) to be 
stored at each site which may be experimentally more feasible 
than storing a large number of qubits in a single site. How- 
ever, this raised the issue of the non-local implementation of 
quantum gates. We have addressed this question and have 
shown what the minimal resources for the implementation of 
two-qubit quantum gates are. We have presented explicit op- 
timal constructions for the local implementation of non-local 
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control-U gates. We have generalized these results to multi- 
party gates such as for example the Toffoli gate. We have also 
adressed some issues concerning the structure of the informa- 
tion exchange that is required in these implementations. We 
hope that this work will be useful for the assessment of the 
viability of distributed quantum computation. 

We acknowledge useful discussion with Daniel Jonathan 
and John Vaccaro. This work was supported by the Deutsche 
Forschungsgemeinschaft (DFG), the UK engineering and 
physical sciences research council (EPSRC), The Leverhulme 
Trust, the European Science Foundation (ESF) programme on 
quantum information processing, the EQUIP programme of 
the European Union and the State Scholarships Foundation of 
Greece. 

Endnote: During completion of this work we became aware 
of the closely related work by D. Collins, N. Linden, and S. 
Popescu, Phys. Rev. A 64, 032302 (2001), quant-ph/0005 102. 
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